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The packet routing problem plays an essential role in communication networks. It involves 
how to transfer data from some origins to some destinations within a reasonable amount 
of time. In the (£, fc)-routing problem, each node can send at most £ packets and receive 
at most k packets. Permutation routing is the particular case £ = k — 1. In the r-central 
routing problem, all nodes at distance at most r from a fixed node v want to send a packet 
to v. 

In this article we study the permutation routing, the r-central routing and the general 
(£, fc)-routing problems on plane grids, that is square grids, triangular grids and hexagonal 
grids. We use the store- and-forward A-port model, and we consider both full and half- 
duplex networks. We first survey the existing results in the literature about packet routing, 
with special emphasis on (£, fc)-routing on plane grids. Our main contributions are the 
following: 

1. Tight permutation routing algorithms on full-duplex hexagonal grids, and half du- 
plex triangular and hexagonal grids. 

2. Tight r-central routing algorithms on triangular and hexagonal grids. 

3. Tight (k, fc)-routing algorithms on square, triangular and hexagonal grids. 

4. Good approximation algorithms (in terms of running time) for (£, fc)-routing on 
square, triangular and hexagonal grids, together with new lower bounds on the running 
time of any algorithm using shortest path routing. 

These algorithms are all completely distributed, i.e., can be implemented indepen- 
dently at each node. Finally, we also formulate the (£, fc)-routing problem as a Weighted 
Edge Coloring problem on bipartite graphs. 
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1. Introduction 

In telecommunication networks, it is essential to be able to route communications as 
quickly as possible. In this context, the packet routing problem plays a capital role. 
In this problem we are given a network and a set of packets to be routed through the 
nodes and the edges of the network graph. A packet is characterized by its origin and 
its destination. We suppose that an edge can be used by no more than one packet at 
the same time. The objective is to find an algorithm to compute a schedule to route 
all packets while minimizing the total delivery time. This problem has been widely 
studie d in th e literature under many different assumptions. In 1988, in their seminal 
article EM, Leighton, Maggs and Rao proved the existence of a schedule for routing 
any set of packets with edge-simple paths on a general network, in optimal time of 
0(C + D) steps. Here C is the congestion (maximum number of paths sharing an 
edge) and D the dilation (length of the longest path) and it is assumed that the 
paths are given a priori. The proof of used Lovasz Local Lemma and was non 
constructive. This result was further improved in ^Sl where the same authors gave 
an explicit algorithm, using the Beck's constructive version of the Local Lemma. 

These algorithms to compute the optimal schedule are centralized. Then in 
Ostrovsky and Rabani gave a distributed randomized algorithm running in 0(C + 
D + log 1+e (n)) steps (see Section [1.11 for more references). 

Although these results are asymptotically tight, they deal with a general network, 
and in many cases it is possible to design more efficient algorithms by looking at 
specific packet configurations or network topologies. For instance, is it natural to 
bound the maximum number of messages that a node can send or receive. We focus 
on this point in Section 11.21 where we will formally define the problem studied in 
this paper. 

On the other hand, the network considered plays a major role on the quality and the 
simplicity of the solution. For example, in a radio wireless environment, cellular net- 
works are usually modeled by a hexagonal grid where nodes represent base stations. 
The cells of the hexagonal grids have good diameter to area ratio and still have a 
simple structure. If centers of neighboring cells are connected, the resulting graph is 
called a triangular grid. Notice that hexagonal grids are subgraphs of the triangular 
grid. We will talk about such networks in Section 11.31 In this paper we focus on 
the study of the (£, fc)-routing problem in convex subgraphs, i.e., subgraphs of the 
square, triangular and hexagonal grids which contain all shortest paths between all 
pairs of nodes. 
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1.1. General Results on Packet Routing 

In this section we provide a fast overview of the state-of-the-art of the general packet 
routing problem, in both the off-line and on-line settings in Sections II. 1.21 and 1 1.1. 31 
respectively, focusing mostly on the later. We begin by recalling three classical lower 
bounds for the packet routing problem. 



1.1.1. Classical lower bounds 

In the packet routing problem, there are three classical types of lower bounds for 
the running time of any algorithm: 

1. Distance bound: the longest distance over the paths of all packets (usually 
called dilation and denoted by D) constitutes a lower bound on the number 
of steps required to route all the packets. 

2. Congestion bound: the congestion of an edge of the network is defined as 
the number of paths using this edge. The greatest congestion over all the 
edges of the network (called congestion and denoted by C) is also a lower 
bound on the number of steps, since at each step an edge can be used by at 
most one packet. 

3. Bisection bound: Let G = (V, E) be the graph which models the network, 
and F C E be a cut-set disconnecting G into two components G\ and G<i- 
Let m be the number of packets with origin in G\ and destination in Gi- 
The number of routing steps used by any algorithm is at least jj^ 



1.1.2. Off-line routing 

Given a set of packets to be sent through a network, a path system is defined as the 
union of the paths that each packet must follow. For a general network and any set of 
n demands, we have seen in Section fl-l-ll that the dilation and the congestion provide 
two lower bounds for the routing time. This proves that the dilation + congestion of 
a paths system used for the routing procedure is a lower bound of twice the routing 
time. In a celebrated paper, Leighton, Maggs and Rao proved the following theorem: 

Theorem 1.1 (EH) For any set of requests and a path system for these requests, 
there is an off-line routing protocol that needs 0{C + D) steps to route all the 
requests, where C is the congestion and D is the dilation of the path system. 

In addition, in S3 the authors show that, given the set of packets to be sent, it 
is possible to find in polynomial time a path system with C + D within a factor 4 
of the optimum. Thus, Theorem 1 1 . 1 1 can be announced in a more general way: 

Theorem 1.2 (321) For any set of requests, there is an off-line routing protocol 
that needs 0{C + D) steps to route all the requests, where C + D is the minimum 
congestion + dilation over all the possible path systems. 
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Furthermore, this routing protocol uses fixed buffer size, i.e., the queue size at 
each is bounded by a constant at each step. Nevertheless, it is important to notice 
that a huge constant may be hidden inside the O notation. As we said before, this 
result was further improved in ^Sl where the same authors gave an explicit algorithm. 
These algorithms to compute the optimal schedule are centralized. In a distributed 
algorithm nodes must make their decisions independently, based on the packets they 
see, without the use of a centralized scheduler. In ^52 Ostrovsky and Rabani gave 
a distributed randomized algorithm running in 0{C + D + log 1+e (n)) steps. We 
refer to Scheideler's thesis ^ for a complete compilation of general packet routing 
algorithms. 



1.1.3. On-line routing 

In the on-line setting, the oldest on-line protocol that deviates only by a factor loga- 
rithmic in n from the best possible runtime 0{C + D) for arbitrary path-collections 
is the protocol presented by Leighton, Maggs and Rao in the same paper ^H, run- 
ning in 0(C + Dlog(Dn)) steps with high probability. This schedule assumes that 
the paths are given a priori, hence it does not consider the problem of choosing the 
paths to route the packets. 

The results of ^ provide a routing algorithm that is log n competitive with respect 
to the congestion. In other words, it is worse than an optimal off-line algorithm only 
by a factor logn. In this setting the demands arrive one by one and the algorithm 
routes calls based on the current congestion on the various links in the network, so 
this can be achieved only via centralized control and serializing the routing requests. 
In SI the authors gave a distributed algorithm that repeatedly scans the network so 
as to choose the routes. This algorithm requires shared variables on the edges of the 
network and hence is hard to implement. Note that the two on-line algorithms above 
depend on the demands and are therefore adaptive. Recall that an oblivious routing 
strategy is specified by a path system V and a function w assigning a weight to every 
path in V . This function w has the property that for every source-destination pair 
(s, t), the system of flow paths V St t for (s, t) fulfills X^e-p s t w ( < l) = ^- ® ne can think 
of this function as a frequency distribution among several paths going from an origin 
s to a destination t. In adaptive routing, however, the path taken by a packet may 
also depend on other packets or events taking place in the network during its travel. 
Remark that every oblivious routing strategy is obviously on-line and distributed. 

The first paper to perform a worst case theoretical analysis on oblivious routing 
is the paper of Valiant and Brebner I^H, who considered routing on specific network 
topologies such as the hypercube. They gave a randomized oblivious routing algo- 
rithm. Borodin and Hopcroft ^ and subsequently Kaklamanis, Krizanc, and Tsan- 
tilas ^221 showed that deterministic oblivious routing algorithms cannot approximate 
well the minimal load on any non-trivial network. 

In a recent paper, Racke'^gave the construction of a polylog competitive obliv- 
ious routing algorithm for general undirected networks. It seems truly surprising 
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that one can come close to minimal congestion without any information on the cur- 
rent load in the network. This result has been improved by Azar et al. Lower 
bounds on the competitive ratio of oblivious routing have been studied for various 
types of networks. For example, for the ci-dimensional mesh, Maggs et al. gave 
the w(^ !L (logn)) lower bound on the competitive ratio of an oblivious algorithm on 
the mesh, where C* is the optimal congestion. 

So far, the oblivious algorithms studied in the literature have focused on mini- 
mizing the congestion while ignoring the dilation. In fact, the quality of the paths 
should be determined by the congestion C and the dilation D. An open question 
is whether C and D can be controlled simultaneously. An appropriate parameter 
to capture how good is the dilation of a path system is the stretch, defined as the 
maximum over all packets of the ratio between the length of the path taken by the 
routing protocol and the length of a shortest path from source to destination. In 
a recent work, Bush et al.® considered again the case of the d-dimensional mesh. 
They presented an on-line algorithm in which C and D are both within 0{d 2 ) of the 
potential optimal, i.e., D = 0{d 2 D*) and CO{dC* log(n)), where is the optimal 
dilation. Note that by the results of Maggs et al. ^51, it is impossible to have a factor 
better than logn). 

There is a simple counter-example network that shows that in general the two 
metrics {dilation and congestion) are orthogonal to each other: take an adjacent 
pair of nodes u, v and 0(- v /re) disjoint paths of length 0(y / n) between u and v. For 
packets traveling from u to v, any routing algorithm that minimizes congestion has 
to use all the paths, however, in this way some packets follow long paths, givinghigh 
stretch. Nevertheless, in grids^, and in some special kind of geometric networks^the 
congestion is within a poly-logarithmic factor from optimal and stretch is constant 
(d the dimension). As mentioned before an interesting open problem is to find other 
classes of networks where the congestion and stretch are minimized simultaneously 
^. Possible candidates for such networks could be for example bounded-growth net- 
works, or networks whose nodes are uniformly distributed in closed polygons, which 
describe interesting cases of wireless networks. 

The recent paper of Maggs ^31 surveys a collection of theoretical results that relate 
the congestion and dilation of the paths taken by a set of packets in a network to 
the time required for their delivery. 

1.2. Routing Problems 

The initial and final positioning of the packets has a direct influence on the time 
needed for their routing. Considering static packet configuration, the most studied 
constraints refer to the maximum number of packets that a node can send and re- 
ceive. Due to their practical importance, some of these problems have specific names: 

1. Permutation routing: each node is the origin and the destination of at 
most one packet. To measure the routing capability of an interconnection 
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network, the partial permutation routing (PPR) problem is usually used as 
the metric. 

2. (£, &)-routing: each node is the origin of at most £ packets and destination 
of at most k packets. Permutation routing corresponds to the case £ = k = 1 
of (£, fc)-routing. Another important particular case is the (1, £;)-routing, in 
which each node sends at most one packet and receives at most k packets. 

3. (1, cmy)-routing: each node is the origin of at most one packet but there 
are no constraints on the number of packets that a node can receive. 

4. r-central routing: all nodes at distance at most r of a central node send 
one message to this central node. 

In all these problems, we are given an initial packet configuration and the objective 
is to route all packets to their respective destinations minimizing the total routing 
time, under the constraint that each edge can be used by at most one packet at the 
same time. 

Besides of the constraints about the initial and final positions of the packets, 
there also exist different routing models at the intermediate nodes of the network. 
For instance, in the hot potato model no packet can be stored at the nodes of the 
network, whereas in the store- and- forward at each step a packet can either stay at 
a node or move to an adjacent node. 

On the other hand, one can consider constraints on the number of incident edges 
that each node of the network can use to send or receive packets at the same time. In 
the A-port model each node can send or receive packets through all its incident 
edges at the same time. 

In this article we study the store-and-forward A-port model. In addition, we 
suppose that cohabitation of multiple packets at the same node is allowed. I.e., a 
queue is required for each outgoing edge at each node. 

The nature of the links of the network is another factor that influences the routing 
efficiency. The type of links is usually one of the following: full-duplex or half-duplex. 
In the full-duplex case there are two links between two adjacent nodes, one in each 
direction. Hence two packets can transit, one in each direction, simultaneously. In 
the half-duplex case only one packet can transit between two nodes, either in one 
direction of the edge or in the other. In this paper we study both half and full-duplex 
links. 

1.3. Topologies 

We now give a brief summary of various cases of (£, fc)-routing and (1, any)-routing 
that have been studied for several specific topologies. More precisely, in Section [1.3.1l 
we list some of the important results for some networks which have attracted interest 
in the literature, like hypercubes and circulant graphs. We move then to plane grids 
in Section 11.3.21 It is well known that there exist only three possible tessellations of 
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the plane into regular polygons : squares, triangles and hexagons. These graphs 
are those which we study in this article. 

1.3.1. Different network topologies 

Hwang, Yao, and Dasgupta studied the permutation routing problem in low- 
dimensional hypercubes (d < 12). They gave optimal or good-in-the- worst-case 
oblivious algorithms. Another network widely studied in the literature is the two 
dimensional mesh with row and column buses. This network can also be diversified 
according to the capacities of the buses. In^H, Suel gave a deterministic algorithm to 
solve the permutation routing problem in such networks. The algorithm provides a 
schedule using at most n + o(n) steps and queues of size two. He also proposed a de- 
terministic algorithm for r-dimensional arrays with buses working in (2— j,)n + o(n) 
steps and still using queues of size 2. In ^23, the authors studied the (£, £)-routing 
problem in the mesh grid with two diagonals and gave a deterministic algorithm 
using ^fj + (in 2 / 3 ) steps for £ > 9. 

In^^l, the authors introduced an algorithm called big foot algorithm. The idea of 
this algorithm is to identify two types of links and to move towards the destination 
using first the links of the first type and then those of the second type. The algorithms 
we develop will use such a strategy. They give an optimal centralized algorithm for 
the permutation routing problem in full-duplex 2-circulant graphs and double-loop 
networks. This later network is of great practical importance. It is modeled by a 
graph with vertex set V = {vq, . . . , i> n -i} such that there are two integers hi and /12 
such that the edge set is E = {viVi±h 1 , ViVi±h 2 }. The permutation routing problem 
in this network is studied by Dobravec, Robic, and Zerovnik H^l The authors gave 
an algorithm for the permutation routing problem which in mean uses 1.12£ steps 
(the mean being empirically measured). In ^1 the authors described an optimal 
centralized permutation routing algorithm in /s-circulant graphs (k > 2), and inE21 
an optimal distributed permutation routing in 2-circulant graphs was obtained. 

The problem has been also studied for packets arriving dynamically by Havil 

j n [18|^ w h ere an optimal online schedule for the linear array is given. Havil also gave 

a 2-approximation for rings and show that, us ing shortest path routing, no better 

10 -1 1 

approximation algorithm exists. Jan and Lin ^ studied Cube Connected Cycles 
CCC(n,2 n ). These are hypercubes of dimension n where each node is replaced by 
a cycle of length n. They gave an algorithm working in 0(n 2 ) with 0(1) buffers for 
the online partial permutation routing (PPR). 

1.3.2. Plane grids 

Maybe the most studied networks in the literature are the two dimensional grids 
(or plane grids), and among them in particular the square grid has deserved special 
attention. Let us briefly overview what has been previously done on (£, fc)-routing 
in plane grids. 
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Fig. 1. Hexagonal network (A) and hexagonal tessellation (O)- 



Leighton, Makedon, and Tollis obtained the first optimal permutation rout- 
ing with running time 2n — 2 and queues of size 1008. Rajasekaran and OverholtSI 
reduced the queue size to 112. Sibeyn, Chlebus, and Kaufmann S3 reduced this 
to 81. Furthermore, they provided another algorithm running in near-optimal time 
2n + 0(1) steps with a maximum queue size of only 12. Makedon and Symvonis^^ 
introduced the (1, /c)-routing and the (1, any)-routing problems and gave an asymp- 
totically optimal algorithm for (1, A;)-routing on plane grids, with queues of small 
constant size. This result was further improved by Sibeyn and Kaufman in S3 where 
they gave a near-optimal deterministic algorithm running in Vk^ + 0(n) steps. They 
gave another algorithm, slightly worse, in terms of number of steps, but with queues 
of size only 3. They also studied the general problem of (£, &)-routing in square 
grids. They proposed lower bounds and near-optimal randomized and deterministic 
algorithms. They finally extended these algorithms to higher dimensional meshes. 
They performed (£, ^-routing in 0(£n) steps, the lower bound being fi(V £kn) for 
(£, fc)-routing. Finally, Pietracaprina and Pucci gave deterministic and random- 
ized algorithms for (£, fc)-routing in square grids, with constant queue size. The 
running time is 0(y/£kn) steps, which is optimal according to the bound of S3. This 
work closed a gap in the literature, since optimal algorithms were only known for 
£ = 1 and £ = k. 

Nodes in a hexagonal network are placed at the vertices of a regular triangular 
tessellation, so that each node has up to six neighbors. In other words, a hexagonal 
network is a finite subgraph of the triangular grid. These networks have been studied 
in a variety of contexts, specially in wireless and interconnection networks. The most 
known application may be to model cellular networks with hexagonal networks where 
nodes are base stations. But th ese net works have been also applied in chemistry to 
model benzenoid hydrocarbons I 52 | 25 |^ in image processing and computer graphics 
[26] 

In a radiocommunication wireless environment ^3, the interconnection network 
among base stations constitutes a hexagonal network, i.e., a triangular grid, as it is 
shown in FlG. [TJ 

Tessellation of the plane with hexagons may be considered as the most natural 
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model of networks because cells have optimal diameter to area ratio. The triangular 
grid can also be obtained from the basic 4-mesh by adding NE to SW edges, which is 
called a 6-mesh in ^3. Here we study convex subgraphs, i.e., subgraphs that contain 
all shortest paths between all pairs of nodes, of the square, triangular and hexago- 
nal grids. Summarizing, to the best of our knowledge the only optimal algorithms 
concerning (£, fc)-routing on plane grids (according to the lower bound of^J) have 
been found on square grids, but modulo a constant factor ^Sl. On triangular and 
hexagonal grids, the best results are randomized algorithms with good performance 

M 



1.4. Our Contribution 

In this paper we study the permutation routing, r-central and (£, fc)-routing problems 
on plane grids, that is square grids, triangular grids and hexagonal grids. We use 
the store- and- forward A-port model, and we consider both full and half-duplex 
networks. 

We have seen in Section 11.3.21 that the only plane grid for which there existed 
an optimal (£, /c)-routing is the square grid. In addition, these articles concerning 
(£, fe)-routing in plane grids are optimal modulo a constant factor. In this paper we 
improve these results by giving tight algorithms including the constant factor, in 
the cases of square, triangular and hexagonal grids. It is important to stress that 
all the algorithms presented in this paper except the one given in Appendix B are 
distributed. Our algorithms only use shortest paths, therefore they achieve minimum 
stretch. In addition, the algorithms are oblivious, so they can be used in an on-line 
scenario. However the performance guarantees that we prove apply only to the off- 
line case. The new results are the following: 

1. Tight (also including the constant factor) permutation routing algorithms in 
full-duplex hexagonal grids, and half duplex triangular and hexagonal grids. 

2. Tight (also including the constant factor) r-central routing algorithms in 
triangular and hexagonal grids. 

3. Tight (also including the constant factor) [k, fc)-routing algorithms in square, 
triangular and hexagonal grids. 

4. Good approximation algorithms for (£, /c)-routing in square, triangular and 
hexagonal grids. 



This paper is structured as follows. In Section [2] we study the permutation routing 
problem. Although permutation routing had already been solved for square grids, 
we begin in Section \2. II by illustrating our algorithm for such grids. Then in Section 
12.21 we give tight permutation routing algorithm for half-duplex triangular grids, 
using the optimal algorithm of S3. In Section 12.31 we provide a tight permutation 
routing algorithm for full-duplex hexagonal grids and a tight permutation routing 
algorithm for half-duplex hexagonal grids. In Section [3] we focus on (1, any )-routing, 
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giving an optimal r-central routing algorithms for the three types of grids. We finally 
move in Section [4] to the general (£, /c)-routing problem. We provide a distributed 
algorithm for (£, A;)-routing in any grid, using the ideas of the optimal algorithm for 
permutation routing. We also prove lower bounds for the worst-case running time of 
any algorithm using shortest path routing. In addition, these lower bounds allow us 
to prove that our algorithm turns out to be tight when £ = k, yielding in this way a 
tight (fc, fc)-routing algorithm in square, triangular and hexagonal grids. We propose 
in Appendix B an approach to (£, /c)-routing in terms of a graph coloring problem: 



the Weighted Bipartite Edge Coloring. We give a centralized algorithm using 
this reduction. 



2. Permutation Routing 

As we have already said in Section [H in the permutation routing problem, each 
processor is the origin of at most one packet and the destination of no more than 
one packet. The goal is to minimize the number of time steps required to route 
all packets to their respective destinations. It corresponds to the case £ = k = 1 
of the general (£, /c)-routing problem. This problem has been studied in a wide 
diversity of scenarios, such as Mobile Ad Hoc Networks ^31, Cube-Connected Cycle 
(CCC) Networks EI, Wireless and Radio Networks 1° , All-Optical Networks ^3 and 
Reconfigurable Meshes^. 

In a grid with full-duplex links an edge can be crossed simultaneously by two 
messages, one in each direction. Equivalently, each edge between two nodes u and v 
is made of two independent arcs uv and vu, as illustrated in Fig. [2k. 




a) 



b) 



Fig. 2. a) Each edge consists of two independent links, b) Axis used in a triangular grid. 



Remark 2.1. If the network is half-duplex, it is easy to construct a 2- 
approximation algorithm from an optimal algorithm for the full-duplex case by 
introducing odd- even steps, as explained for example in 031. 
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2.1. Square grid 

Many communication networks are represented by graphs satisfying the following 
property: for any pair of nodes u and v, the edges of a shortest path from u to v 
can be partitioned into k disjoint classes according to a well-defined criterium. For 
instance, on a triangular grid the edges of a shortest path can be partitioned into 
positive and negative ones S21 Similarly, on a /c-circulant graph the edges can be 
partitioned into k classes according to their length. 

In graphs that satisfy this property there exists a natural routing algorithm: 
route all packets along one class of edg es after another. For hexagonal networks this 
algorithm turns out to be optimal Optimality for 2-circulant graphs is proved 
using a static approach in and recently using a dynamic distributed algorithm 
in In US the authors introduce the notion of big-foot algorithms because their 
algorithm routes packets first along long hops and then along short hops in a 2- 
circulant graph. 

On the square grid, the big-foot algorithm consists of two phases, moving each 
packet first horizontally and then vertically. In this way a packet may wait only dur- 
ing the second phase. Using the fact that all destinations are distinct, the optimality 
for square grid is easy to prove. Summarizing, it can be proved that 

Theorem 2.1. There is a translation invariant oblivious optimal permutation rout- 
ing algorithm for full-duplex networks that are convex subgraphs of the infinite 
square grid. 

2.1.1. Regarding the queue size 

Of course, this is not the first optimal permutation routing result on square grids, 
as the classical x — y routing (first route packets through the horizontal axis, and 
then through the vertical axis) has been used for a long time. Thus, another more 
challenging issue is to reduce the queue size, as we have already discussed in Section 
11.3.21 Leighton describes inEOla simple off-line algorithm for solving any permutation 
routing problem in 3n — 3 steps on a n x n square grid, using queues of size one. 
Since the diameter of a n x n square grid is 2n — 2, this algorithm provides a |- 
approximation. The main drawback is that this algorithm is off-line and centralized. 
In contrast, our oblivious distributed algorithm is optimal in terms of running time, 
but it is easy to see that on a n x n square grid, the queue size can be Up 
to date, the best algorithm running in optimal time to route permutation routing 
instances on square grids is the algorithm of Sibeyn et al. SS using queues of size 
81. So far, there is no algorithm that guarantees optimal running time with queues 
of size 1, and it is unlikely that such an algorithm exists. 

Remark 2.2. The same observation regarding the unbounded queue size applies 
to all the algorithms described in this article. However, our aim is to match the 
optimal running time, rather than minimizing the queue size. Additionally, it turns 
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out that some appropriate modifications of the permutation routing algorithms that 
we provide for plane grids allow us to find oblivious algorithms which route any 
permutation within a factor 3 of the optimal running time, and using queues of size 
1 (in fact, we can say something stronger: we just need memory to keep 1 message 
at each node). We do not describe these modifications in this article. 



2.2. Triangular grid 

We use the addressing scheme introduced in S3 and used also in E31 we represent 
any address on a basis consisting of three unitary vectors i, j, k on the directions of 
three axis x, y, z with a 120 degree angle among them, intersecting on an arbitrary 
(but fixed) node O . This node is the origin and is given the address O = (0,0,0). 
This basis is represented in Fig. (2Jd. Thus, we can assume that each node P £ V 
is labeled with an address P = (P\, P2, P3) expressed in this basis {i, j, k} with 
respect to the origin O. At the beginning, each node S knows the address of the 
destinat ion node D of the message placed initially at S, and computes the relative 
address SD = D — S of the message. Note that this relative address does not depend 
on the choice of the origin node O. This relative address is the only information that 
is added in the heading of the message to be transmitted, constituting in this way 
the packet to be sent through the network. 

Using that i + j + k = 0, it is easy to see that if (a, b, c) and (a', b', d) are the 
relative addresses of two packets, then (a, b, c) = (a', b' , c') if and only if there exists 
d G Z such that a' = a + d, b' = b + d, and c' = c + d. 

We say that an address SD = (a, b, c) is of the shortest path form if there is a 
path from node S to node D, consisting of a units of vector i, b units of vector j 
and c units of vector k, and this path has the shortest length. 

Theorem 2.2 d*^j) An address (a, b, c) is of the shortest path form if and only if 
at least one component is zero, and the two other components do not have the same 
sign. 

Corollary 2.1 (j^^) Any address has a unique shortest path form. 

Thus, each address SD written in the shortest path form has at most two non-zero 
components, and they have different signs. In fact, it is easy to find the shortest 
path form using the next result. 

Theorem 2.3 (S3) // SD = ai + bj + ck, then 

\SD\ = min(|a — c| + \b — c|, \a — b\ + \b — c\, \a — b\ + \a — c|). 

Permutation routing on full-duplex triangular grids has been solved recently^! 
attaining the distance lower bound of £ mSuX routing steps, where £ max is the maximum 
length over the shortest paths of all packets to be sent through the network. 

As said in Remark 12.11 if the network is half-duplex, one can construct a 2- 
approximation algorithm from an optimal algorithm for the full-duplex case by 
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introducing odd-even steps. Thus, using this algorithm we obtain an upper bound 
of 2£ max for half-duplex triangular grids. 

Let us show with an example that this naive algorithm is tight. That is, we shall 
give an instance running steps, implying that no better 

algorithm for a general instance exists. Indeed, consider a set of nodes distributed 
along a line on the triangular grid. We fix ^ max and an edge e on this line, and put 
^max packets at each side of e along the line, at distance at most ^ max — 1 from 
an end- vertex of e. For each packet, each destination is chosen on the other side 
of e with respect to its origin, at distance exactly ^ max from the origin. It is easy 
to check that the congestion of e (that is, the number of shortest paths containing 
e) is 2£ m3X , and thus any algorithm using shortest path routing cannot perform in 
less than 2£ max steps. On the other hand, ^ max is a lower bound for any distance, 
yielding that the approximation ratio of our algorithm is at most 2. 

Previous observations allow us to state the next result: 

Theorem 2.4. There exists a tight permutation routing algorithm for half-duplex 
triangular grids performing in at most 2^ max steps, where £ max is the maximum 
length over the shortest paths of all packets to be sent. This algorithm is a 2- 
approximation algorithm for a general instance. 

2.3. Hexagonal grid 

In a hexagonal grid one can define three types of zigzag chains ^3, represented with 
thick lines in Fig. O Similarly to the triangular grid, in the hexagonal grid any 
shortest path between two nodes uses at most two types of zigzag chains Let us 
now give a lower bound for the running time of any algorithm. Consider the edge 
labeled as e in Fig. O and the two chains containing it (those shaping an X). Fix 
^max and e, and put one message on all nodes placed at both chains at distance at 
most £ max — 1 from an endvertex of e. As in the case of the triangular grid, choose 
the destinations to be placed on the other side of e along the same zigzag chain 
than the originating node, at distance exactly £ max from it. It is clear that all the 
shortest paths contain e. It is also easy to check that the congestion of e is 4£ max — 4 
in this case, constituted of symmetric loads 2£ max — 2 in each direction of e. Thus, 
2^max — 2 establishes a lower bound for the running time of any algorithm in the 
full-duplex case, whereas 4£ max — 4 is a lower bound for the half-duplex case, under 
the assumption of shortest path routing. 

Let us now describe a routing algorithm which reaches this bound. We have three 
types of edges according to the angle that they form with any fixed edge. Each edge 
belongs to exactly two different chains, and conversely each chain is made of two 
types of edges. Moreover, in an infinite hexagonal grid any two chains of different 
type intersect exactly on one edge. 

Given a pair of origin and destination nodes S and D, it is possible to express 
the relative address D — S counting the number of steps used by a shortest paths on 
each type of chain. In this way we obtain an address D — S = (a,b, c) on a generating 
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Fig. 3. Zigzag chains in a hexagonal grid. 

system made of unitary vectors following the directions of the three types of chains 
(it is not a basis in the strict sense, since these vectors are not linearly independent 
on the plane. However, we will call it so). Choose this basis so that the three vectors 
form angles of 120 degrees among them. As it happ ens on the triangular grid I2Z|4H ) 
there are at most two non-zero components (see^J), and in that case they must have 
different sign. Nevertheless, in this case, the address is not unique, since an edge 
placed at the bent (that is, a change from a type of chain to another) of a shortest 
path is part of both types of chains. Anyway, this ambiguity is not a problem in the 
algorithm we propose, as we will see below. 

Suppose first that edges are bidirectional or, said otherwise, full-duplex. Roughly, 
the idea is to use the optimal algorithm for triangular grids described in SH, and 
adapt it to hexagonal grids. For that purpose we label the three types of zigzag 
chains c±,C2,cs, and the three types of edges ei,e2,e^. Without loss of generality, 
we label them in such a way that c\ uses edges of type e<i and e3, C2 uses e\ and e%, 
and C3 uses e\ and e2 (see Fig. SJ). 




Fig. 4. 3 types of chains and edges in a hexagonal grid. 

For each type of edge, we define two phases according to the type of chain that 
uses this type of edge. This defines two global phases, namely: during Phase 1, c\ 
uses e2, C2 uses es, and C3 uses e±. Conversely, during Phase 2 c\ uses e$, C2 uses e±, 
and C3 uses e2- 

We suppose that at each node packets are grouped into distinct queues according 
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to the next edge (according to the rules of the algorithm) along its shortest path. 
Given the relative addresses D — S in the form (a, b, c), the algorithm can be de- 
scribed as follows. 

At each node u of the network: 

1) During the first step, move all packets along the direction of their negative 
component. If a packet's address has only a positive component, move it 
along this direction. 

2) From now on, change alternatively between Phase 1 and Phase 2. At each 
step (the same for both phases): 

a) If there are packets with negative components, send them immediately 
along the direction of this component. 

b) If not, for each outgoing edge order the packets in decreasing number 
of remaining steps, and send the first packet of each queue. 

3) If at some point, all the packets in u have remaining distance one, send them 
immediately. 

Let us analyze the correctness and optimality of this algorithm. 

In 1) all packets can move, since initially there is at most one packet at each 
node. In 2a), there can only be one packet with negative component at each outgoing 
edge EH T n 2b) the packet with maximum remaining length at each outgoing edge is 
unique. Indeed all these packets are moving along their last direction (their negative 
component is already finished, otherwise they would be in 2a)) and each node is the 
destination of at most one packet. Hence, using this algorithm, every 2 steps (one 
of Phase 1 and one of Phase 2) the maximum remaining distance over all packets 
decreases by one. Moreover, during the first step all packets decrease their remaining 
distance by one. Because of this, after the (2£ max — 3)th step the maximum remaining 
distance has decreased at least 1 + 2 ^™~ 4 = £ max — 1 times, hence the maximum 
remaining distance is 1, and we are in 3). Since all destinations are different, all 
packets can reach simultaneously their destinations. Thus, the total running time is 
at most 2£ max -3 + l = 21 max — 2, meeting the worst case lower bound. Again, ^ max is 
a lower bound for any instance, hence the algorithm constitutes a 2-approximation 
for a general instance. 

Theorem 2.5. There exists a tight permutation routing algorithm for full-duplex 
hexagonal grids performing in 2£ max — 2 steps, where £ max is the maximum length 
over the shortest paths of all packets to be sent. 

Remark 2.3. The optimality stated in Theorem 12.51 is true only under the as- 
sumption of shortest path routing. This means that for certain traffic instances the 
total deliver time may be shorter if some packets do not go through their shortest 
path. To illustrate this phenomenon, consider the example of Fig. [5j Node labeled i 
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wants to send a message to node labeled i', for i = 1, . . . , 8. We have that -£ max = 5, 
and thus our algorithm performs in 2£ max — 2 = 8 steps. It is clear that all shortest 
paths use edge e, and its congestion bottlenecks the running time. Suppose now that 
we route the messages originating at even nodes through the path defined by the 
edges {a,b,c,d}, instead of {/, e}, and keep the shortest path routing for messages 
originating at odd nodes. One can check that with this routing only 7 steps are 
required. 




Fig. 5. Shortest path is not always the best choice. 



In the half-duplex case, just introduce again odd-even steps in both phases. 
Thus, we have Phase 1-even, Phase 1-odd, Phase 2-even, and Phase 2-odd, which 
take place sequentially. Now, 1) consists obviously of two steps (even/odd). Using 
this algorithm, every 4 steps the maximum remaining distance decreases by one. In 
addition, during the first 2 steps and during the last 2 steps all packets decrease 
their remaining distance by one. Thus, the total running time is at most twice the 
time of the full-duplex case, that is 2(2£ max — 2) = 4£ max — 4 steps, meeting again 
the lower bound for the running time of any routing algorithm using shortest path 
routing. Again, this algorithm constitutes a 4-approximation for a general instance. 

Theorem 2.6. There exists an tight permutation routing algorithm for half-duplex 
hexagonal grids performing in 4£ max — 4 steps, where £ max is the maximum length 
over the shortest paths of all packets to be sent. 



Remark 2.4. As explained in Appendix A, there exists an embedding of the trian- 



gular grid into the hexagonal grid with load, dilation, and congestion 2. Using this 
embedding, any algorithm performing on k steps on the triangular gird performs on 
2k steps on the hexagonal grid. Using this fact, we obtain a permutation routing 
algorithm on full-duplex hexagonal grids performing on 2£ max steps. Note that the 
optimal result given in Theorem 12.51 is slightly better. 

The same applies to half-duplex hexagonal networks, with a running time of 
4-^max using the embedding, in comparison to 4£ max — 4 steps given by Theorem [ 
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3. (1, am/)-Routing 

In this case the routing model is the following: each packet has at most one packet 
to send, but there are no constraints on the destination. That is, in the worst case 
all packets can be sent to one node. This special case where all packets want to 
send a message to the same node in often called gathering in the literature Notice 
that this routing model is conceptually different from the (1, /c)-routing, where the 
maximum number of packets that a node can receive is fixed a priori. 

Square grid Assume first that edges are bidirectional. The modifications for the 
half-duplex case are similar to those explained in the previous section. 

We will focus on the case where all packets surrounding a given vertex want to 
send a packet to that vertex. We call this situation central routing, and if we want 
to specify that all nodes at distance at most r from the center want to send a packet, 
we note it as r- central routing. Note that this situation is realistic in many practical 
applications, since the central vertex can play the role of a router or a gateway in a 
local network. 

Lemma 3.1 (Lower Bound) The number of steps required in a r-central routing 
is at least ( r '£ 1 ) • 

Proof. Let us use the bisection bound ^ to prove the result. It is easy to count 
the number of points at distance at most r from the center, which is 4( r ^ 1 ). Now 
consider the cut consisting of the four edges outgoing from the central vertex. All 
packets must traverse one of these edges to arrive to the central vertex. This cut 
gives the bisection bound of 4( r ~^ 1 ) /4 routing steps. □ 

Let us now describe an algorithm meeting the lower bound. 

Proposition 3.1. There exists an optimal r-central routing algorithm on square 
grids performing in ) routing steps. 

Proof. Express each node address in terms of the relative address with respect to 
the central vertex. In this way each node is given a label (a, b). Then, for each packet 
placed in a node with label (a, b) our routing algorithm performs the following: 

• If ab = 0, send the packet along the direction of the non-zero component. 

• If ab > 0, send the packet along the vertical axis. 

• If ab < 0, send the packet along the horizontal axis. 

Queues are managed so that the packets having greater remaining distance have 
priority. 

This routing divides the square grid into 4 subregions surrounding the central 
vertex, as shown in Fig. The type of routing performed in each subregion is 
symbolized by an arrow. 
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Fig. 6. Division of the grid in the proof of Proposition 13.1 



Let us now compute the running time in the r-central case. It is obvious that 
using this algorithm all packets are sent to the 4 axis outgoing from the central 
vertex. The congestion of the edge in the axis containing the central vertex along 
each line is l + 2 + 3 + ... + r = ( r ^)- Since at each step one packet reaches its 
destination along each line, we conclude that f^ 1 ) is the total running time of the 
algorithm. □ 

Triangular grid The same idea applies to the triangular grid. In this case, the 
number of nodes at distance at most r is 6( r "^" 1 ) . The cut is made of 6 edges. Dividing 
the plane onto 6 subregions gives again an optimal algorithm performing in ( r ^ 1 ) 
steps. 

Hexagonal grid The same idea gives an optimal routing in the r-central case. In 
this case the degree of each vertex is three, and then it is easy to check (maybe a 
drawing using Fig. [3] can help) that there are ^{ r ^ 1 ) nodes at distance at most d 
that may want to send a message to the central vertex, and the cut has size 3. As 
expected, the running time is again ( r ^ 1 )- 

4. (£, fc)-Routing 

Recall that in the general (I, fc)-routing problem each node sends at most I packets 
and receives at most k packets. We propose a distributed approximation algorithm 
using the ideas of the previous algorithms for the permutation routing problem. We 
also provide lower bounds for the running time of any algorithm using shortest path 
routing, that allow us to prove that our algorithm is tight when t = k, on any grid. 

We start by describing the results for full-duplex triangular grids. (The results 
can be adapted to square grids.) We also show how to adapt the results to hexagonal 
grids and to the half-duplex version. In this section we denote c := max ff' fc | = 

["max-f^, . Note that c > 1. Lemma [4.1l and Lemma [4.21 provide two lower bounds 
for the running time of any algorithm using shortest paths. 



Lemma 4.1 (First lower bound) The worst-case running time of any algorithm 
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for (£, k)-routing on full-duplex triangular grids using shortest path routing satisfies 



Running time > min{^, k} 



Proof. Consider a set of £ max nodes placed along a line, placed consecutively at 
one side of a distinguished edge e. Each node wants to send mm{£, k} messages to 
the nodes placed at the other side of e along the line, at distance £ max from it. Then 
the congestion of e is min{£, k} ■ £ max , giving the bound. □ 

Definition 4.1. Given a vertex v, we call the rectangle of side (a, b) starting at v 
the set R v a b = {v + ai +{3j , < a < a, < f3 < b}. We call such a rectangle a square 
if a = b. Notice that in the triangular grid the node set is generated by {i,j,k}, 
where k = — i — j, as we have explained in Section 12.21 

Using standard graph terminology, given a graph G = (V, E) and a subset SCI', 
the set T(S) denotes the (open) neighborhood in G of the vertices in S. The following 
theorem can be found, for example, in 021. 

Theorem 4.1 (Corollary of Hall's theorem Let G = (V,E) be a bipartite 
graph, with V = X U Y. If for all subsets A of X, \T(A)\ > c\A\, then for each 
x £ X, there exists S x C Y such that \S X \ = c, and \/x,x' £ X , S x D S x > = and 
S x CT(x). 

We use this theorem to prove the following lower bound. 

Lemma 4.2 (Second lower bound) The worst-case running time of any algo- 
rithm for (£,k) -routing on full-duplex triangular grids using shortest path routing 
satisfies 



Running time > 



max{£, k} 



+ 1 



v / c~Tl 



where c 



max{£,fc} 
min{^,fc} 



Proof. Suppose without loss of generality that £ > k, otherwise change the role of 



£ and k. Let v be a vertex, and consider the square R v dd -, with d := 
claim that all nodes inside this square can send £ messages such that a 



We 



1 destination 



nodes are in the destination set D 



R v 



\ R v dd - Let S be the subgrid 
generated by positive linear combinations of the vectors i and j. More precisely, 
S := {v + ai + (3j , a > 0, (3 > 0}. Fig. [2b gives a graphical illustration. 

To prove this, we consider a bipartite graph H on vertex set R dd U D, with an 
edge between a vertex of R d d and a vertex of D if they are at distance at most f max 
in S. To apply Theorem 14.11 we have to show that any subset of vertices A C R d d 
has at least c\A\ neighbors in H. Theorem 14.11 will then ensure the existence of a 
feasible repartition of the messages from vertices of R d d to those of D such that all 
shortest paths have length at most ^ max - 
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Given A C R dd , let us call Da := {u £ D : dists(A,u) < £ max }, where 
dists(A, u) means the minimum distance in S from any vertex of A to the vertex u. 
For any A <Z R v dd , we need to show that 

\D A \>c\A\ (4.1) 

Without loss of generality we suppose that A is maximal, in the sense that there 
is no set A' strictly containing A with Da = Da>- Instead of considering all possible 
sets A, we will show below that we can restrict ourselves to rectangles. Hence given 
a set A, we denote by Ra the smallest rectangle containing the subset of vertices A. 
We first claim that 

\D Ra \D a \<\Ra\A\ (4.2) 

Indeed, this equality can be shown by induction on |.R>i\A|. For = the 

equality is trivial. Suppose that it is true for \Ra\ A\. The induction step consists in 
showing that there is an element x in Ra\A such that | D Ra \Dau{x} I — I D Ra \D a | < 1 
(note that D Rau{x} = D Ra ): 

• If there exists x such that x + j and x — i are in A and x — j is not in A, then 
we select this x. From x the only new vertex we may add to Da is x + ^ m ax*- 

• Otherwise, if there exists x such that x — j and x — i are in A and x + i is 
not in A, then we select this x. In this case the only new vertex we may add 
to D A is x - £ max k. 

• If none of the previous cases holds, since Ra is the smallest rectangle con- 
taining A, and A is maximal, then necessarily there exists an x such that 
x + i and x — j are in A and x — i is not in A. We select this x, and the only 
new vertex we may add to Da is x + £ max j. 

Thus, in all cases there exists an x adding at most one neighbor to D Ra \Da, which 
finishes the induction step and proves Equation (|4,2p . To finish the proof of the 
fact that we can restrict ourselves to rectangles, we show that, for any subset A, 
if Inequality (|4.ip holds for Ra, then it also holds for A. Indeed, Inequality flUD 
applied to Ra gives: 

c|-Ra| < IARaI j which is equivalent to 
c(\A\ + \R A \A\)< \D A \ + \D Ra \ D A \ (4.3) 

Using Inequality (|4.2p and the fact that c > 1, Inequality (|4.3p clearly implies 
that Inequality (|4.ip holds. 

Henceforth we assume that A is a rectangle. The last simplification consists in 
proving that we can restrict ourselves to rectangles containing v. In other words, it 
will be sufficient to prove Inequality (|4.ip for all rectangles R v ab - Given a rectangle 



(£,k)-Routing on Plane Grids 21 



R not positioned at v, the rectangle R' of the same size positioned at v has less 
neighbors, hence if Inequality (|4.ip holds for R' , it also holds for R. 

Finally let us prove that Inequality (|4,ip holds for all rectangles R v a b , with 1 < 
a,b < d. We have \R" J = ah and \Dr» I = (a + £ ma x)(b + Anax) — d 2 . By the choice 

of d, starting from the Inequality d < - m ^. 1 and using that 1 < a, b, one obtains 
that d 2 c < (^max + a) (^max + ^) — d 2 for any 1 < a, b < d. This implies, using a,b < d, 
that cab < (a + ^ m ax)(H^max) — d 2 for any 1 < a, b < d, hence Inequality (|4.ip (i.e. 
c\Rl b \ < \D K J) holds. 

So by Theorem 14. H each one of the d 2 nodes in R v d d can send t messages to the 
nodes of D. Since the number of edges going from R v d d to D is 4d — 1, we apply the 
bisection bound discussed in Section II. 1.11 to conclude that there is an edge of the 
border of the square R dd with congestion at least > \nt~\ - This finishes the 

proof of the lemma. □ 

We observe that this second lower bound is strictly better than the first one if 
and only if 



Vc+l Cax + 1 

If both c and ^ max are big, the condition becomes approximately: 

maxjl, k} i6 
min{^, k} 

That is, the second lower bound is better when the difference between i and k 
is big. This is the case of broadcast or gathering, where messages are originated (or 
destined) from (or to) a small set of nodes of the network. 

The two lower bounds can be combined to give: 

Lemma 4.3 (Combined lower bound) The worst-case running time of any al- 
gorithm for (I, k) -routing on full-duplex triangular grids using shortest path routing 
satisfies 

Running time > max ( ^ max • min{£, k}, maxjf, k] ■ m ^ x = : 
V |_4Vc + l_ 

4v^+l") 

Now we provide an algorithm from which we derive an upper bound. 

Proposition 4.1 (Upper bound (algorithm)) The algorithm for (£,k) -routing 
on full-duplex triangular grids is the following: route all packets as in the permutation 
routing case. That is, at each node send packets first in their negative component, 



max{£,k} 

£ max • max mm{£, k\, 
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breaking ties arbitrarily (there can be £ packets in conflict in a negative component). 
If there are no packets with negative components, send any of the ( at most k ) packets 
with maximum remaining distance. 

mm{£, k} ■ + m&x{£, k} ■ (£ max - c + 1) , if c < £ m3jX 

mm{£, k} ■ ,ifc> £ max 



Running time < ^ ' 2 



Proof. Suppose again without loss of generality that £ > k. We proceed by decreas- 
ing induction on ^ max . We prove that after min{^, £ max k} steps, each packet will be 
at distance at most £ max — 1 of its destination. This yields 

Running time(i max ) < min{£, £ max k} + Running time(£ max - 1) 

imn{£, k} ■ + max{l, k} ■ (£ max - c) , if c < £ max - 1 

min {£ ! k } . W(W-l) j if c > w _ 1 



< minR £ max k} + { ^' "* " . , , 



< 



min{^, A;} • -^-g — - + maxjf, fe} • (^ max — c+l) , if c< 

min{£, fc} • W( ^ ax+1) , if c> Cax 



Let us consider the messages at distance £ max to their destinations. They are 
of two types, the one moving according to their negative component and the one 
moving according to their positive component. 

If c < ^ max the first ones move after at most £ time steps. If c < £ max they move 
more quickly, indeed they move at least once every £ max k steps (£ ma , x k < c • k = £). 
This is due to the fact that when c < ^ max at a given vertex, at most £ ma xk messages 
may have to move according to their negative component toward a node at distance 

About the messages which move according to their positive component, since a 
node is the destination of at most k messages, they may wait at most k steps. 

Consequently, ^ max decreases by at least one every min{^, £ max k] steps, which 
gives the result. □ 

This gives an algorithm which is fully distributed. Dividing the running time of 
this algorithm by the combined lower bound we obtain the following ratio: 



i{l,fc}-Q+max{l,fc}-(l max -c+l) 
max-maxl mm{l,k\ , 4 / ' J I 
min{l,fc}-(l majc +l) C 



if r < f 



if C > 



2.max(min{<,fc} , 

We observe that in all cases the running time of the algorithm is at most 
max{£, k} ■ £ max . In particular, when £ = k (that is, c = 1) the running time is 
exactly max{£, k} ■ £ max = mm{£, k} ■ £ max , and therefore it is tight (see lower bound 
of Lemma 14. If) . 

Corollary 4.1. There exists a tight algorithm for (k,k)-routing in full- duplex tri- 
angular grids. 
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The previous algorithms can be generalized for half-duplex triangular grids as 
well as for full and half-duplex hexagonal grids. The generalization to half-duplex 
grids is obtained by just adding a factor 2 in both the lower bound and the running 
time of the algorithm, as we did for the permutation routing algorithm. Thus, let us 
just focus on the case of full-duplex hexagonal grids, for which we have the following 
theorems: 

Theorem 4.2. There exists an algorithm for (£, A:)-routing in full-duplex hexagonal 
grids whose running time is at most: 



max 



D . .. . j 2mm{£,k}-^^ + 2m&x{£,k}-(£ max -c+l) , if c < 
Runninq time < < z , e A\ 

~ \ 2min{£, k} ■ hssi^tn , if c > ^ max 

Lemma 4.4 (First lower bound) No algorithm based on shortest path routing 
can route all messages using less than 2 min{^, k} ■ £ ma _ x — mm{£, k} steps in the 
worst case. 

Definition 4.2. Given a vertex v, we call the rectangle of the hexagonal grid of side 
(a, b) starting at v to the subset of the hexagonal grid R V h exa b = {v+ai+f3j+~/k, < 
a < a, —7 < [3 < b, < j < b} n H where H is the vertex set of the hexagonal grid. 
We call such a rectangle a square if a = b. 

The following lemma gives a second lower bound on the running time of any 
algorithm using shortest path routing on full-duplex hexagonal grids. 

Lemma 4.5 (Second lower bound) The worst-case running time of any algo- 
rithm using shortest path routing on full- duplex hexagonal grids satisfies: 



where d 



Running time > 

V73c+64lg mx + 1 21 + 144:1 ~ 
Sy/c+1 



max{£, k}(2d + — — -) 
1 ;v 2d+ V 



and c 



max{^,fc} 
min{£,fc} 



I 

performance approximately twice better than in triangular grids. 



Notice that when is big, this value tends to 2 max{^, fc}-^=, obtaining a 



Proof. The proof consists in showing that the vertices of R^ exd d can simultaneously 
send max(£, k) messages to some vertices of Rlexd+e^d+l^ \ R hexd,d- This is done 
as for the triangular grid, using again Theorem 14.11 We do not give all the details, 
since the idea behind is the same as the proof of Lemma l4.2i 

Since the number of vertices inside R v hexd d is Ad 2 + d — 2, and the number of edges 

outgoing from R v hexd d is 2d+ 1, the congestion on these edges is max{£, fc} 4 ^^" 2 = 
m&x{£,k}(2d + □ 
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5. Conclusions and Further Research 

In this article we have studied the permutation routing, the r-central routing and 
the general (£, fc)-routing problems on plane grids, that is square grids, triangular 
grids and hexagonal grids. We have assumed the store- and- forward A-port model, 
and considered both full and half-duplex networks. The main new results of this 
article are the following: 



1. Tight (also including the constant factor) permutation routing algorithms on 
full-duplex hexagonal grids, and half duplex triangular and hexagonal grids. 

2. Tight (also including the constant factor) r-central routing algorithms on 
triangular and hexagonal grids. 

3. Tight (also including the constant factor) (k, fc)-routing algorithms on 
square, triangular and hexagonal grids. 

4. Good approximation algorithms for (£, A;)-routing in square, triangular and 
hexagonal grids, together with new lower bounds on the running time of any 
algorithm using shortest path routing. 



All these algorithms are completely distributed, i.e., can be implemented indepen- 
dently at each node. Finally, in Appendix B we have formulated the {£, fc)-routing 
problem as a Weighted Edge Coloring problem on bipartite graphs. 

There still remain several interesting open problems concerning (£, fc)-routing on 
plane grids. Of course, the most challenging problem seems to find a tight (£, k)- 
routing algorithm for any plane grid, for £ ^ k. Another interesting avenue for 
further research is to take into account the queue size. That is, to devise (£, k)- 
routing algorithms with bounded queue size, or that optimize both the running 
time and the queue size, under a certain trade-off. 
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Appendix A. Denning the embeddings 

The results already known for the square grid can be used for a triangular (resp. 
a hexagonal) grid if we have an adapted function mapping the square grid into 
the triangular (resp. hexagonal) grid. Here we propose both functions, namely 
square2triangle and square2hexagon. 

The function square2triangle is illustrated in Fig. [7J We perform the same rout- 
ing as in the grid, i.e. we just ignore the extra diagonal. 



Fig. 7. Square grid mapped into the triangular grid. 

In a square grid the distance between two vertices is at most twice the distance 
in a triangular grid. Similarly the congestion is at most doubled going from the 
triangular grid to the square grid. Nevertheless the maximal distance is unchanged. 
Indeed, the NW and SE nodes of Fig. [7J are at the same distance in both grids. 
Consequently, an algorithm which routes a permutation in 2n— 2 steps is still optimal 
in the worst case. If instead of considering a square grid with one extra diagonal, 
we look at a triangle grid as on Fig. [U or in shape of a triangle, Using the routing 
of the square grid in this triangle grid yields a routing within twice the optimal (i.e. 
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minimum time in the worst case). 




Fig. 8. Triangular grid. 



The function square2hexagon is a little more complicated. Squares are mapped 
in two different ways on the hexagonal grid, as shown in Fig. [9j Some are mapped 
on the left side of a hexagon and some on the right side. Call them respectively 
white and black squares. White and black squares alternate on the grid like white 
and black on a chess board. The missing edge of a white square is mapped to the 
path of length 3 that goes on the right of the hexagon. The missing edge of a black 
square is mapped on the path of length 3 that goes on the right of the hexagon. 
In this way each edge of the square grid is uniquely mapped and each edge of the 
hexagonal grid is the image of exactly two edges of the square grid. 

The distance between 2 vertices in the hexagonal grid is twice the distance in 
the square grid plus one. Also when we adapt a routing from the square grid to 
the hexagonal grid using the function square2hexagon, the congestion may double 
since each edge of the hexagonal grid is the image of two edges of the square grid. 
Consequently, a routing obtained using the function square2hexagon will be within 
a constant multiplicative factor of the optimal. 

Appendix B. An Approach for (£, fc)-routing Using Weighted 
Coloring 

In any physical topology, we can represent a given instance of the problem in the 
following way. Given a network on n nodes, we build a bipartite graph H with a 
copy of each node at both sides of the bipartition. We add an edge between u and 
v whenever u wants to send a message to v, and assign to each edge uv a weight 
w(uv) equals to the length of a shortest path from u to v on the original grid. In 
this way we obtain an edge-weighted bipartite graph H on In nodes. Note that the 
maximum degree of H satisfies A < m&x{£, k}. An example for £ = 2 and k = 3 is 
depicted in Fig. HTJ1 

The key idea behind this construction is that each matching in H corresponds 
to an instance of a permutation routing problem. Hence, it can be solved optimally, 
as we have proved for all types of grids in Section [2j For each matching Mi, we 
define its cost as c(Mj) := max{u>(e)|e S Mi)}. We assign this cost because on all 
grids the running time of the permutation routing algorithms we have described are 
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Fig. 9. Square grid mapped into the hexagonal grid. 

proportional to the length of the longest shortest path (with equality on full-duplex 
triangular grids). 
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1=2 k=3 




Fig. 10. Bipartite graph modeling a (2, 3)-routing instance. 

^,From the classical Hall's theorem we know that the edges of a bipartite graph 
can be partitioned into A disjoint matchings (that is, a coloring of the edges), A 
being the maximum degree of the graph. In our case we have A = max{£, k}. Thus, 
the problem consists in partitioning the edges of H into A matchings M\ , . . . , Ma j m 
such a way that X/j=i c i^i) is minimized. That is, our problem, namely Weighted 
Bipartite Edge Coloring, can be stated in the following way: 

Weighted Bipartite Edge Coloring 
Input: An edge- weighted bipartite graph H. 

Output: A partition of the edges of H into matchings Mi, . . . , Ma, with c(Mj) := 
max{w;(e)|e € Mi)}. 
Objective: min^^ =1 c(Mj). 

Therefore, min^^ =1 c(Mj) is the running time for routing an (£, &)-routing in- 
stance using this algorithm. 

Unfortunately, inM Weighted Edge Coloring is proved to be strongly NP- 
complete for bipartite graphs, which is the case we are interested in. In fact, the 
problem remains strongly NP-complete even restricted to cubic and planar bipartite 
graphs. Concerning approximation results, the authors provide an inapproxima- 
bility bound of g — e, for any e > 0. Furthermore, they match this bound with an 
approximation algorithm within 7/6 on graphs with maximum degree 3, improving 
the best known approximation ratio of 5 /3 ^1. In ^31 this innaproximability bound 
is proved independently on general bipartite graphs. Thus, if max{£, k} < 3 we 
can find a solution of Weighted Bipartite Edge Coloring within | times the 
optimal solution, and this will be also a solution for the (£, A:)-routing problem. 

Remark Appendix B.l. Although of theoretical value, the main problem of this 
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algorithm is that finding these matchings is a centralized task. In addition, the 
true ratio, i.e. related to the optimum of the (£, fc)-routing, should be proved to 
provide an upper bound for the running time of this algorithm. 
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